AITopics | learning generative model

Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance

Neural Information Processing SystemsDec-25-2025, 23:18:31 GMT

Minimum expected distance estimation (MEDE) algorithms have been widely used for probabilistic models with intractable likelihood functions and they have become increasingly popular due to their use in implicit generative modeling (e.g.\ Wasserstein generative adversarial networks, Wasserstein autoencoders). Emerging from computational optimal transport, the Sliced-Wasserstein (SW) distance has become a popular choice in MEDE thanks to its simplicity and computational benefits. While several studies have reported empirical success on generative modeling with SW, the theoretical properties of such estimators have not yet been established. In this study, we investigate the asymptotic properties of estimators that are obtained by minimizing SW. We first show that convergence in SW implies weak convergence of probability measures in general Wasserstein spaces. Then we show that estimators obtained by minimizing SW (and also an approximate version of SW) are asymptotically consistent. We finally prove a central limit theorem, which characterizes the asymptotic distribution of the estimators and establish a convergence rate of $\sqrt{n}$, where $n$ denotes the number of observed data points. We illustrate the validity of our theory on both synthetic data and neural networks.

asymptotic guarantee, generative model, learning generative model, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.97)

Add feedback

Learning Generative Models with Visual Attention

Charlie Tang, Nitish Srivastava, Russ R. Salakhutdinov

Neural Information Processing SystemsOct-2-2025, 20:59:20 GMT

Neural Information Processing Systems http://nips.cc/

learning generative model, visual attention

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Generation (0.40)

Add feedback

Learning Generative Models with Visual Attention

Neural Information Processing SystemsSep-30-2025, 09:08:34 GMT

Attention has long been proposed by psychologists to be important for efficiently dealing with the massive amounts of sensory stimulus in the neocortex. Inspired by the attention models in visual neuroscience and the need for object-centered data for generative models, we propose a deep-learning based generative framework using attention. The attentional mechanism propagates signals from the region of interest in a scene to an aligned canonical representation for generative modeling. By ignoring scene background clutter, the generative model can concentrate its resources on the object of interest. A convolutional neural net is employed to provide good initializations during posterior inference which uses Hamiltonian Monte Carlo. Upon learning images of faces, our model can robustly attend to the face region of novel test subjects. More importantly, our model can learn generative models of new faces from a novel dataset of large images where the face locations are not known.

learning generative model, name change, visual attention, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)

Add feedback

Learning Generative Models with Visual Attention

Charlie Tang, Nitish Srivastava, Russ R. Salakhutdinov

Neural Information Processing SystemsFeb-9-2025, 19:22:16 GMT

Attention has long been proposed by psychologists to be important for efficiently dealing with the massive amounts of sensory stimulus in the neocortex. Inspired by the attention models in visual neuroscience and the need for object-centered data for generative models, we propose a deep-learning based generative framework using attention. The attentional mechanism propagates signals from the region of interest in a scene to an aligned canonical representation for generative modeling. By ignoring scene background clutter, the generative model can concentrate its resources on the object of interest. A convolutional neural net is employed to provide good initializations during posterior inference which uses Hamiltonian Monte Carlo. Upon learning images of faces, our model can robustly attend to the face region of novel test subjects. More importantly, our model can learn generative models of new faces from a novel dataset of large images where the face locations are not known.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > Canada > Ontario > Toronto (0.14)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)

Add feedback

Reviews: Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance

Neural Information Processing SystemsJan-27-2025, 01:17:48 GMT

Clarity: the article is clear and well written, In this aspect the paper is an "accept" for me. This is an accept as well (6) Quality: this paper is of high quality, it is clear there is a significant research effort behind. The combination "theoretical results empirical validation in simple cases" is sensible given the type of paper this is, and the audience. Accept too (6) Originality: This is the item where I tend to reject more than to accept (5). I think it is definitely original, but all the theoretical contributions seem to me a bit marginal: I am very familiar with Bernton et al 2018, the paper that develops the technique (in turn, mainly based on Basseti et al 2006 and Pollard 1980) that is used here.

asymptotic guarantee, learning generative model, sliced-wasserstein distance

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Generation (0.40)

Add feedback

Reviews: Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance

Neural Information Processing SystemsJan-27-2025, 01:17:36 GMT

The reviewers liked the paper and voted for an accept that was confirmed following authors feedback. But the discussion highlighted the fact that the result do not discuss the problem of sampling on the unit sphere that needs to be done when actually learning generative models. It will probably add some variance in practice and should be at least discussed in the final paper and investigated in future works.

asymptotic guarantee, learning generative model, sliced-wasserstein distance

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Generation (0.74)

Add feedback

Learning Generative Models with Visual Attention

Neural Information Processing SystemsJan-18-2025, 08:28:01 GMT

Attention has long been proposed by psychologists to be important for efficiently dealing with the massive amounts of sensory stimulus in the neocortex. Inspired by the attention models in visual neuroscience and the need for object-centered data for generative models, we propose a deep-learning based generative framework using attention. The attentional mechanism propagates signals from the region of interest in a scene to an aligned canonical representation for generative modeling. By ignoring scene background clutter, the generative model can concentrate its resources on the object of interest. A convolutional neural net is employed to provide good initializations during posterior inference which uses Hamiltonian Monte Carlo.

learning generative model, visual attention

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.65)

Add feedback

Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance

Neural Information Processing SystemsOct-10-2024, 22:25:45 GMT

Minimum expected distance estimation (MEDE) algorithms have been widely used for probabilistic models with intractable likelihood functions and they have become increasingly popular due to their use in implicit generative modeling (e.g.\ Wasserstein generative adversarial networks, Wasserstein autoencoders). Emerging from computational optimal transport, the Sliced-Wasserstein (SW) distance has become a popular choice in MEDE thanks to its simplicity and computational benefits. While several studies have reported empirical success on generative modeling with SW, the theoretical properties of such estimators have not yet been established. In this study, we investigate the asymptotic properties of estimators that are obtained by minimizing SW. We first show that convergence in SW implies weak convergence of probability measures in general Wasserstein spaces.

asymptotic guarantee, learning generative model, sliced-wasserstein distance, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.82)

Add feedback

Learning Generative Models with Visual Attention

Neural Information Processing SystemsMar-13-2024, 12:33:31 GMT

Attention has long been proposed by psychologists to be important for efficiently dealing with the massive amounts of sensory stimulus in the neocortex. Inspired by the attention models in visual neuroscience and the need for object-centered data for generative models, we propose a deep-learning based generative framework using attention. The attentional mechanism propagates signals from the region of interest in a scene to an aligned canonical representation for generative modeling. By ignoring scene background clutter, the generative model can concentrate its resources on the object of interest. A convolutional neural net is employed to provide good initializations during posterior inference which uses Hamiltonian Monte Carlo. Upon learning images of faces, our model can robustly attend to the face region of novel test subjects. More importantly, our model can learn generative models of new faces from a novel dataset of large images where the face locations are not known.

approximate inference, generative model, inference, (15 more...)

Neural Information Processing Systems

Country: North America > Canada > Ontario > Toronto (0.14)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)

Add feedback

Learning Generative Models for Climbing Aircraft from Radar Data

Pepper, Nick, Thomas, Marc

arXiv.org Artificial IntelligenceSep-26-2023

Accurate trajectory prediction (TP) for climbing aircraft is hampered by the presence of epistemic uncertainties concerning aircraft operation, which can lead to significant misspecification between predicted and observed trajectories. This paper proposes a generative model for climbing aircraft in which the standard Base of Aircraft Data (BADA) model is enriched by a functional correction to the thrust that is learned from data. The method offers three features: predictions of the arrival time with 66.3% less error when compared to BADA; generated trajectories that are realistic when compared to test data; and a means of computing confidence bounds for minimal computational cost.

aircraft type, dataset, trajectory, (13 more...)

arXiv.org Artificial Intelligence

2309.14941

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Georgia > Clayton County (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > Serbia > Central Serbia > Belgrade (0.04)

Genre: Research Report (0.64)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Air (1.00)
Aerospace & Defense > Aircraft (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Data Science > Data Mining (0.68)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.64)

Add feedback

Filters

Collaborating Authors

learning generative model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance

Learning Generative Models with Visual Attention

Learning Generative Models with Visual Attention

Learning Generative Models with Visual Attention

Reviews: Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance

Reviews: Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance

Learning Generative Models with Visual Attention

Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance

Learning Generative Models with Visual Attention

Learning Generative Models for Climbing Aircraft from Radar Data